PA Clan
   HOME

TheInfoList



OR:

The PA clan ( Proteases of mixed nucleophile,
superfamily SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, str ...
A) is the largest group of
protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
s with common ancestry as identified by
structural homology A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similari ...
. Members have a
chymotrypsin Chymotrypsin (, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duodenu ...
-like fold and similar
proteolysis Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called protease ...
mechanisms but can have identity of <10%. The clan contains both
cysteine Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile. When present as a deprotonated catalytic residue, sometime ...
and
serine proteases Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Serin ...
(different
nucleophiles In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
). PA clan proteases can be found in
plants Plants are predominantly Photosynthesis, photosynthetic eukaryotes of the Kingdom (biology), kingdom Plantae. Historically, the plant kingdom encompassed all living things that were not animals, and included algae and fungi; however, all curr ...
,
animals Animals are multicellular, eukaryotic organisms in the biological kingdom Animalia. With few exceptions, animals consume organic material, breathe oxygen, are able to move, can reproduce sexually, and go through an ontogenetic stage in ...
,
fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
,
eubacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of prokaryotic microorganisms. Typically a few micrometr ...
,
archaea Archaea ( ; singular archaeon ) is a domain of single-celled organisms. These microorganisms lack cell nuclei and are therefore prokaryotes. Archaea were initially classified as bacteria, receiving the name archaebacteria (in the Archaebac ...
and
viruses A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsky's 1 ...
. The common use of the
catalytic triad A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, lip ...
for hydrolysis by multiple clans of proteases, including the PA clan, represents an example of
convergent evolution Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last com ...
. The differences in the catalytic triad within the PA clan is also an example of
divergent evolution Divergent evolution or divergent selection is the accumulation of differences between closely related populations within a species, leading to speciation. Divergent evolution is typically exhibited when two populations become separated by a geog ...
of
active site In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate (binding site) a ...
s in enzymes.


History

In the 1960s, the
sequence similarity Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
of several proteases indicated that they were evolutionarily related. These were grouped into the chymotrypsin-like serine proteases (now called the
S1 family S1, S01, S.I, S-1, S.1, Š-1 or S 1 may refer to: Biology and chemistry * S1 nuclease, an enzyme that digests singled-stranded DNA and RNA * S1: Keep locked up, a safety phrase in chemistry * Primary somatosensory cortex, also known as S1 * Tega ...
). As the structures of these, and other proteases were solved by
X-ray crystallography X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
in the 1970s and 80s, it was noticed that several viral proteases such as Tobacco Etch Virus protease showed
structural homology A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similari ...
despite no discernible sequence similarity and even a different nucleophile. Based on structural homology, a
superfamily SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, str ...
was defined and later named the PA clan (by the
MEROPS MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibitor ...
classification system). As more structures are solved, more protease families have been added to the PA clan superfamily.


Etymology

The ''P'' refers to ''P''roteases of mixed nucleophile. The ''A'' indicates that it was the first such clan to be identified (there also exist the PB, PC, PD and PE clans).


Structure

Despite retaining as little as 10% sequence identity, PA clan members isolated from viruses, prokaryotes and eukaryotes show
structural homology A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similari ...
and can be aligned by structural similarity (e.g. with DALI).


Double β-barrel

PA clan proteases all share a core motif of two β-barrels with covalent catalysis performed by an acid-histidine-nucleophile
catalytic triad A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, lip ...
motif. The barrels are arranged perpendicularly beside each other with hydrophobic residues holding them together as the core scaffold for the enzyme. The triad residues are split between the two barrels so that
catalysis Catalysis () is the process of increasing the rate of a chemical reaction by adding a substance known as a catalyst (). Catalysts are not consumed in the reaction and remain unchanged after it. If the reaction is rapid and the catalyst recyc ...
takes place at their interface.


Viral protease loop

In addition to the double β-barrel core, some viral proteases (such as TEV protease) have a long,
flexible Flexible may refer to: Science and technology * Power cord, a flexible electrical cable. ** Flexible cable, an Electrical cable as used on electrical appliances * Flexible electronics * Flexible response * Flexible-fuel vehicle * Flexible rake rec ...
C-terminal loop that forms a lid that completely covers the substrate and create a binding tunnel. This tunnel contains a set of tight binding pockets such that each side chain of the substrate peptide (P6 to P1’) is bound in a complementary site (S6 to S1’) and specificity is endowed by the large contact area between enzyme and substrate. Conversely, cellular proteases that lack this loop, such as
trypsin Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting these long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
have broader specificity.


Evolution and function


Catalytic activity

Structural homology A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similari ...
indicates that the PA clan members are descended from a common ancestor of the same fold. Although PA clan proteases use a catalytic triad perform 2-step nucleophilic catalysis, some families use
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
as the
nucleophile In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
whereas others use
cysteine Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile. When present as a deprotonated catalytic residue, sometime ...
. The superfamily is therefore an extreme example of divergent enzyme evolution since during evolutionary history, the core catalytic residue of the enzyme has switched in different families. In addition to their structural similarity,
directed evolution Directed evolution (DE) is a method used in protein engineering that mimics the process of natural selection to steer proteins or nucleic acids toward a user-defined goal. It consists of subjecting a gene to iterative rounds of mutagenesis (cre ...
has been shown to be able to convert a cysteine protease into an active serine protease. All cellular PA clan proteases are
serine proteases Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Serin ...
, however there are both serine and
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal Chund ...
families of viral proteases. The majority are
endopeptidases Endopeptidase or endoproteinase are proteolytic peptidases that break peptide bonds of nonterminal amino acids (i.e. within the molecule), in contrast to exopeptidases, which break peptide bonds from end-pieces of terminal amino acids. For this re ...
, with the exception being the S46 family of exopeptidases.


Biological role and substrate specificity

In addition to divergence in their core catalytic machinery, the PA clan proteases also show wide divergent evolution in function. Members of the PA clan can be found in
eukaryote Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
s,
prokaryote A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
s and
virus A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsky's 1 ...
es and encompass a wide range of functions. In mammals, some are involved in
blood clotting Coagulation, also known as clotting, is the process by which blood changes from a liquid to a gel, forming a blood clot. It potentially results in hemostasis, the cessation of blood loss from a damaged vessel, followed by repair. The mechanism o ...
(e.g.
thrombin Thrombin (, ''fibrinogenase'', ''thrombase'', ''thrombofort'', ''topical'', ''thrombin-C'', ''tropostasin'', ''activated blood-coagulation factor II'', ''blood-coagulation factor IIa'', ''factor IIa'', ''E thrombin'', ''beta-thrombin'', ''gamma- ...
) and so have high substrate specificity as well as
digestion Digestion is the breakdown of large insoluble food molecules into small water-soluble food molecules so that they can be absorbed into the watery blood plasma. In certain organisms, these smaller substances are absorbed through the small intest ...
(e.g.
trypsin Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting these long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
) with broad substrate specificity. Several snake venoms are also PA clan proteases, such as
pit viper The Crotalinae, commonly known as pit vipers,Mehrtens JM (1987). ''Living Snakes of the World in Color''. New York: Sterling Publishers. 480 pp. . crotaline snakes (from grc, κρόταλον ''krotalon'' castanet), or pit adders, are a subfa ...
haemotoxin Hemotoxins, haemotoxins or hematotoxins are toxins that destroy red blood cells, disrupt blood clotting, and/or cause organ degeneration and generalized tissue damage. The term ''hemotoxin'' is to some degree a misnomer since toxins that damage ...
and interfere with the victim's blood clotting cascade. Additionally, bacteria such as ''
Staphylococcus aureus ''Staphylococcus aureus'' is a Gram-positive spherically shaped bacterium, a member of the Bacillota, and is a usual member of the microbiota of the body, frequently found in the upper respiratory tract and on the skin. It is often positive ...
'' secrete exfoliative toxin which digest and damage the host's tissues. Many viruses express their
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
as a single, massive polyprotein and use a PA clan protease to cleave this into functional units (e.g.
polio Poliomyelitis, commonly shortened to polio, is an infectious disease caused by the poliovirus. Approximately 70% of cases are asymptomatic; mild symptoms which can occur include sore throat and fever; in a proportion of cases more severe s ...
,
norovirus Norovirus, sometimes referred to as the winter vomiting disease, is the most common cause of gastroenteritis. Infection is characterized by non-bloody diarrhea, vomiting, and stomach pain. Fever or headaches may also occur. Symptoms usually deve ...
, and TEV proteases). There are also several pseudoenzymes in the superfamily, where the catalytic triad residues have been mutated and so function as binding proteins. For example, the
heparin Heparin, also known as unfractionated heparin (UFH), is a medication and naturally occurring glycosaminoglycan. Since heparins depend on the activity of antithrombin, they are considered anticoagulants. Specifically it is also used in the treatm ...
-binding protein Azurocidin has a glycine in place of the nucleophile and a serine in place of the histidine.


Families

Within the PA clan (P=proteases of mixed
nucleophiles In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
), families are designated by their catalytic nucleophile (C=
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal Chund ...
s, S=
serine proteases Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Serin ...
). Despite the lack of sequence homology for the PA clan as a whole, individual families within it can be identified by sequence similarity.


See also

*
Protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
** cysteine- ** serine- ** threonine- ** aspartic- ** metallo- *
Catalytic triad A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, lip ...
*
Homology (biology) In biology, homology is similarity due to shared ancestry between a pair of structures or genes in different taxa. A common example of homologous structures is the forelimbs of vertebrates, where the wings of bats and birds, the arms of pri ...
*
MEROPS MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibitor ...
*
Protein family A protein family is a group of evolutionarily related proteins. In many cases, a protein family has a corresponding gene family, in which each gene encodes a corresponding protein with a 1:1 relationship. The term "protein family" should not be c ...
*
Protein superfamily A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology (biology), homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if n ...
*
Protein structure Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer ma ...
*
Structural alignment Structural alignment attempts to establish homology between two or more polymer structures based on their shape and three-dimensional conformation. This process is usually applied to protein tertiary structures but can also be used for large RN ...


References


External links


MEROPS
- Comprehensive protease database
Superfamily
- A database of protein folds {{Enzymes EC 3.4 Molecular evolution Proteases Protein superfamilies